PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400041776
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family HD-ZIP
Protein Properties Length: 306aa    MW: 34482.8 Da    PI: 8.0917
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400041776genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.74.7e-19135189256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           rk+ ++tkeq  +Le+ F+ +++++ +++  LAk+lgL  rqV vWFqNrRa+ k
  PGSC0003DMP400041776 135 RKKLRLTKEQSAVLEDSFKDHHTLNPKQKLALAKRLGLRPRQVEVWFQNRRARTK 189
                           678899***********************************************98 PP

2HD-ZIP_I/II126.79.9e-41135224191
           HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLre 89 
                           +kk+rl+keq+++LE+sF+ +++L+p++K +la++Lgl+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l++en+rL+kev+eLr 
  PGSC0003DMP400041776 135 RKKLRLTKEQSAVLEDSFKDHHTLNPKQKLALAKRLGLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCVENLTDENRRLQKEVQELR- 222
                           69*************************************************************************************9. PP

           HD-ZIP_I/II  90 el 91 
                           +l
  PGSC0003DMP400041776 223 SL 224
                           55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046184.1E-311108IPR006712HD-ZIP protein, N-terminal
SuperFamilySSF466891.04E-18126192IPR009057Homeodomain-like
PROSITE profilePS5007117.022131191IPR001356Homeobox domain
SMARTSM003891.1E-17133195IPR001356Homeobox domain
PfamPF000462.0E-16135189IPR001356Homeobox domain
CDDcd000862.80E-16135192No hitNo description
Gene3DG3DSA:1.10.10.603.1E-18135187IPR009057Homeodomain-like
PRINTSPR000313.1E-5162171IPR000047Helix-turn-helix motif
PROSITE patternPS000270166189IPR017970Homeobox, conserved site
PRINTSPR000313.1E-5171187IPR000047Helix-turn-helix motif
CDDcd146860.00435184223No hitNo description
Gene3DG3DSA:1.20.5.1708.5E-4188224No hitNo description
SMARTSM003402.3E-25191234IPR003106Leucine zipper, homeobox-associated
PfamPF021834.5E-10191225IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 306 aa     Download sequence    Send to blast
MMGKEDLGLS LSLSFSAEKR TTTINLIPST IISPSSAFNN NYWTTHPPFP HSSPDLNTEA  60
CRLVETKSFL KGIDVNRMPA TAEEEEGGVS SPNSTISSLS GNKRSEREGN CTEENEMERA  120
SSRGISDEED GETCRKKLRL TKEQSAVLED SFKDHHTLNP KQKLALAKRL GLRPRQVEVW  180
FQNRRARTKL KQTEVDCEFL KRCVENLTDE NRRLQKEVQE LRSLKHSPQF YMQMTPPTTL  240
TMCPSCEHVA TGPTNTPVNI PPHRVGPPHQ HHQPMPLNMW GPSSTPISQG HYGQMDTYPT  300
FARQK*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1183191RRARTKLKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755181e-162HG975518.1 Solanum lycopersicum chromosome ch06, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006367464.10.0PREDICTED: homeobox-leucine zipper protein HAT4
SwissprotP466002e-95HAT1_ARATH; Homeobox-leucine zipper protein HAT1
TrEMBLM1C8G70.0M1C8G7_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000620810.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA11182485
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17460.11e-96Homeobox-leucine zipper protein 4 (HB-4) / HD-ZIP protein
Publications ? help Back to Top
  1. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]